NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Generative adversarial networks (GAN) model for dynamically adjusted weld pool image toward human-based model predictive control (MPC)

https://doi.org/10.1016/j.jmapro.2025.02.053

Li, Tianpu; Cao, Yue; Ye, Qiang; Zhang, YuMing (May 2025, Journal of Manufacturing Processes)

Gas Metal Arc Welding (GMAW) is a critical industrial technique known for its high productivity, flexibility, and adaptability to automation. Despite the significant advancements in robotic welding, challenges remain in fully automating the arc welding process, particularly due to the complex dynamics of the weld pool associated with GMAW. A human-robot collaborative (HRC) system where humans operate robots may conveniently provide the needed adaptive control to the complex GMAW. While in conventional HRC systems humans receive process feedback to make adaptive adjustments, we propose provide humans with predictive future feedback to further ease the human decision and reduce the needed skills/trainings. To this end, this study explores the integration of deep learning models, specifically Generative Adversarial Networks (GANs) combined with Gated Recurrent Units (GRUs), to model and predict the dynamic behavior of the weld pool during GMAW. By leveraging time-series data of torch movement and corresponding weld pool images, the proposed GRU-GAN model generates high-fidelity weld pool images, capturing the intricate relationship between speed variations and weld pool morphology. Through extensive experimentation, including the design of an acceptable Encoder-Decoder structure for the GAN, we demonstrate that incorporating both temporal and speed sequence information significantly enhances the model's predictive capabilities. The findings validate the hypothesis that dynamic torch speed adjustments, akin to those performed by skilled human welders, can be effectively modeled to improve the quality of automated welding processes. Future work will be devoted to human-based model predictive control (MPC) in an HRC environment.
more » « less
Free, publicly-accessible full text available May 1, 2026
CorrGAN: Simultaneous Learning of Speech Enhancement and Perceptual Quality Loss Functions

https://doi.org/10.1109/ICASSP49660.2025.10887633

Zadorozhnyy, Vasily; Amizadeh, Saeed; Ye, Qiang; Koishida, Kazuhito (April 2025, IEEE)

Deep-learning models have allowed effective end-to-end SE systems in the Speech Enhancement (SE) field. Most of these methods are trained using a fixed reconstruction loss in a supervised setting. Often these losses do not perfectly represent the desired perceptual quality metrics, resulting in sub-optimal performance. Recently, there have been efforts to learn the behavior of those metrics directly via neural nets for training SE models. However, an accurate estimation of the true metric function introduces statistical complexity for training because it attempts to capture the exact value of the metric. We propose an adversarial training strategy based on statistical correlation that avoids the complexity of estimating the SE metric while learning to mimic its overall behavior. We call this framework CorrGAN and show its significant improvement over standard losses of the SOTA baselines and achieve SOTA performance on the VoiceBank+DEMAND dataset.
more » « less
Free, publicly-accessible full text available April 6, 2026
Orthogonal Gated Recurrent Unit With Neumann-Cayley Transformation

https://doi.org/10.1162/neco_a_01710

Zadorozhnyy, Vasily; Mucllari, Edison; Pospisil, Cole; Nguyen, Duc; Ye, Qiang (November 2024, Neural Computation)

In recent years, using orthogonal matrices has been shown to be a promising approach to improving recurrent neural networks (RNNs) with training, stability, and convergence, particularly to control gradients. While gated recurrent unit (GRU) and long short-term memory (LSTM) architectures address the vanishing gradient problem by using a variety of gates and memory cells, they are still prone to the exploding gradient problem. In this work, we analyze the gradients in GRU and propose the use of orthogonal matrices to prevent exploding gradient problems and enhance long-term memory. We study where to use orthogonal matrices and propose a Neumann series–based scaled Cayley transformation for training orthogonal matrices in GRU, which we call Neumann-Cayley orthogonal GRU (NC-GRU). We present detailed experiments of our model on several synthetic and real-world tasks, which show that NC-GRU significantly outperforms GRU and several other RNNs.
more » « less
Full Text Available
Human-robot collaborative assembly and welding: A review and analysis of the state of the art

https://doi.org/10.1016/j.jmapro.2024.09.044

Cao, Yue; Zhou, Quan; Yuan, Wei; Ye, Qiang; Popa, Dan; Zhang, YuMing (December 2024, Journal of Manufacturing Processes)

Full Text Available
Modeling imaged welding process dynamic behaviors using Generative Adversarial Network (GAN) for a new foundation to monitor weld penetration using deep learning

https://doi.org/10.1016/j.jmapro.2024.05.081

Mucllari, Edison; Cao, Yue; Ye, Qiang; Zhang, YuMing (June 2024, Journal of Manufacturing Processes)

Full Text Available
Breaking Time Invariance: Assorted-Time Normalization for RNNs

https://doi.org/10.1007/s11063-024-11442-1

Pospisil, Cole; Zadorozhnyy, Vasily; Ye, Qiang (March 2024, Neural Processing Letters)

Abstract Methods such as Layer Normalization (LN) and Batch Normalization have proven to be effective in improving the training of Recurrent Neural Networks (RNNs). However, existing methods normalize using only the instantaneous information at one particular time step, and the result of the normalization is a preactivation state with a time-independent distribution. This implementation fails to account for certain temporal differences inherent in the inputs and the architecture of RNNs. Since these networks share weights across time steps, it may also be desirable to account for the connections between time steps in the normalization scheme. In this paper, we propose a normalization method called Assorted-Time Normalization (ATN), which preserves information from multiple consecutive time steps and normalizes using them. This setup allows us to introduce longer time dependencies into the traditional normalization methods without introducing any new trainable parameters. We present theoretical derivations for the gradient propagation and prove the weight scaling invariance property. Our experiments applying ATN to LN demonstrate consistent improvement on various tasks, such as Adding, Copying, and Denoise Problems and Language Modeling Problems.
more » « less
SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks

https://doi.org/10.21437/Interspeech.2023-456

Zadorozhnyy, Vasily; Ye, Qiang; Koishida, Kazuhito (August 2023, ISCA)

Full Text Available
Improving Deep Neural Networks’ Training for Image Classification With Nonlinear Conjugate Gradient-Style Adaptive Momentum

https://doi.org/10.1109/TNNLS.2023.3255783

Wang, Bao; Ye, Qiang (March 2023, IEEE Transactions on Neural Networks and Learning Systems)

Full Text Available
Do We Need a New Foundation to Use Deep Learning to Monitor Weld Penetration?

https://doi.org/10.1109/LRA.2023.3270038

Mucllari, Edison; Yu, Rui; Cao, Yue; Ye, Qiang; Zhang, YuMing (June 2023, IEEE Robotics and Automation Letters)

Full Text Available
Novel Molecular Representations Using Neumann-Cayley Orthogonal Gated Recurrent Unit

https://doi.org/10.1021/acs.jcim.2c01526

Mucllari, Edison; Zadorozhnyy, Vasily; Ye, Qiang; Nguyen, Duc Duy (May 2023, Journal of Chemical Information and Modeling)

Full Text Available

« Prev Next »

Search for: All records